Co-Dispersion: A Windowless Approach to Lexical Association

نویسنده

  • Justin Washtell
چکیده

We introduce an alternative approach to extracting word pair associations from corpora, based purely on surface distances in the text. We contrast it with the prevailing windowbased co-occurrence model and show it to be more statistically robust and to disclose a broader selection of significant associative relationships owing largely to the property of scale-independence. In the process we provide insights into the limiting characteristics of window-based methods which complement the sometimes conflicting application-oriented literature in this area.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lexical Bundles in English Abstracts of Research Articles Written by Iranian Scholars: Examples from Humanities

This paper investigates a special type of recurrent expressions, lexical bundles, defined as a sequence of three or more words that co-occur frequently in a particular register (Biber et al., 1999). Considering the importance of this group of multi-word sequences in academic prose, this study explores the forms and syntactic structures of three- and four-word bundles in English abstracts writte...

متن کامل

Solid Dispersion Approach Improving Dissolution Rate of Stiripentol: a Novel Antiepileptic Drug

Some drugs have low bioavailability due to their poor aqueous solubility and/or slowdissolution rate in biological fluids. Stiripentol (STP) is a novel anticonvulsant drug that isstructurally unrelated to the currently available antiepileptics. It has poor aqueous solubilityand its solubility has to be enhanced accordingly. Polyethyleneglycol 6000 (PEG-6000) iscommonly utilized as a hydrophilic...

متن کامل

A Comparison of Windowless and Window-Based Computational Association Measures as Predictors of Syntagmatic Human Associations

Distance-based (windowless) word assocation measures have only very recently appeared in the NLP literature and their performance compared to existing windowed or frequency-based measures is largely unknown. We conduct a largescale empirical comparison of a variety of distance-based and frequency-based measures for the reproduction of syntagmatic human assocation norms. Overall, our results sho...

متن کامل

The Role of Self-Regulatory Approach in Iranian Learners' Lexical Segmentation: The case of authentic materials

The present research investigated the effect of self-regulatory approach (with two components of self-checking and self-efficacy) on pre-intermediate Iranian learners' lexical segmentation in listening comprehension via authentic listening comprehension texts. To achieve this purpose, the investigators administered an Oxford Placement Test (2007) to ninety-eight students of two girls’ private j...

متن کامل

The Role of Self-Regulatory Approach in Iranian Learners' Lexical Segmentation: The case of authentic materials

The present research investigated the effect of self-regulatory approach (with two components of self-checking and self-efficacy) on pre-intermediate Iranian learners' lexical segmentation in listening comprehension via authentic listening comprehension texts. To achieve this purpose, the investigators administered an Oxford Placement Test (2007) to ninety-eight students of two girls’ private j...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009